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Abstract. In this article we consider the cycle structure of compositions of pairs of invo- 
lutions in the symmetric group S n chosen uniformly at random. These can be modeled as 
modified 2-regular graphs, giving rise to exponential generating functions. A composition 
of two random involutions in S n typically has about \fn cycles, and the cycles are charac- 
teristically of length y/n. Compositions of two random fixed-point-free involutions, on the 
other hand, typically have about logn cycles and are closely related to permutations with 
all cycle lengths even. The number of factorizations of a random permutation into two invo- 
lutions appears to be asymptotically lognormally distributed, which we prove for a closely 
related probabilistic model. This study is motivated by the observation that the number of 
involutions in [n] is \fn\ times a subexponential factor; more generally the number of per- 
mutations with all cycle lengths in a finite set S is n! 1-1 / m times a subexponential factor, 
and the typical number of /c-cycles is nearly n k ' m /k. Connections to pattern avoidance in 
involutions are also considered. 

1. Introduction. The purpose of this paper is to study the cycle structure of compositions 
of involutions. Recall that an involution is a permutation with all cycles having length 1 or 

2. Let a n be the number of involutions in the symmetric group S n . Then as n — > oo, 

(1) a n ~ v / ^!e^(87rer2)- 1 / 4 . 

This form involving \fn\ is due to [9, p. 583]; see [17] for the result in another form, and 
[T8J Example 3.2] for details of the asymptotic analysis by the saddle-point method. The 
factor \/n\ is much faster-growing than e v/ "(87rera)~ 1 / 4 . So in a logarithmic sense the number 
of involutions of [n] is approximately the square root of the number of permutations of 
[n]. Thus the number of pairs of involutions of [n] is logarithmically near n\. This suggests 
identifying permutations with pairs of involutions. A natural way to combine two involutions 
to form a permutation is composition, so we study compositions. We also seek other ways 
in which involutions and permutations are related by forming ordered pairs. In particular, 
Stanley- Wilf limits for various classes of pattern-avoiding permutations are known; in those 
cases where a Stanley- Wilf limit for the corresponding pattern-avoiding involutions exists, 
the former is the square of the latter. 

We begin by asymptotically enumerating permutations with all cycle lengths in a finite 
set S; involutions are the case S = {1,2}. Call a permutation with all cycle lengths in S 
an 5*-permutation. Let p n be the probability that a permutation of [n] selected uniformly 

i (S) 

at random is an S'-permutation. Then lim^oo ° gp ", = — l/(max5'); a refinement of this is 
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Theorem 13.11 below. In particular the number of fc-cycles of a typical S'-permutation is near 
I n fc /( max5 '), A typical involution of [n] has \pn fixed points. We then proceed to represent 
involutions graph-theoretically as partial matchings; thus compositions of two involutions 
can be identified with graphs having 2-colored edges, where each vertex has at most one 
incident edge of each color. The components of such graphs are paths and cycles, so we 
easily find generating functions involving them. This is our principal tool for extracting 
information on the cycle structure of these graphs and the corresponding permutations. In 
particular, if a and r are random involutions of [n], then as n —>■ oo: 

• The distribution of the number of fc-cycles of r o<j converges in distribution to V(l) + 
2V(l/2k) (Theorem EH); 

• The mean number of cycles of r o o (of all lengths) is y/n + | logn + 0(1) (Theorem 

• If cr and r are constrained to be fixed-point-free, then the distribution of the number 
of cycles of r o a is asymptotically normal with mean logn and variance 21ogn. 

Finally, we consider the number of factorizations of a permutation into involutions. The 
mean number of factorizations of tt into two involutions - that is, solutions to tt = r o a, 
with r and a involutions of [n] - is e 2v/ ™/v / 87ren(l + o(l)). We derive a formula (Theorem 
19. ip for the number of factorizations of it E S n into two involutions, in terms of the cycle 
type of tt. This is a product over cycle lengths. In a model of random permutations in 
which there are V(l/k) cycles of length k for k = 1, . . . ,n, the number of factorizations 
of a random permutation tt is lognormally distributed (Theorem 19.41) . If P* denotes this 
probability measure, F(tt) the number of factorizations of tt, and $ the standard normal cdf, 
then 



logF(vr)-i(log 



n 



:2 



lim K \ 2 3 " < x - <S>(x). 

n^oo y i (logn)' 3 / 

In particular the median number of factorizations of tt is near exp((logn) 2 /2), much smaller 
than the mean. This is one of many indications that the measure on S n coming from 
compositions of involutions chosen uniformly at random is much different from the uniform 
measure on S n . 

2. Pattern avoidance. We recall the Stanley- Wilf conjecture (now the Marcus- Tardos the- 
orem [JB|) on pattern avoidance. Let S n (7i) denote the set of 7r-avoiding permutations of 
[n]] then S n (ir) is bounded above by C n , for some constant C depending on tt. We call the 
smallest such C the growth rate of the pattern ir and denote it by L(ir). Arratia [I] has shown 
that the Marcus- Tardos theorem is equivalent to the existence of the limit lim^oo \S n (Tr)\ 1 / n , 
which equals L(tt), for all patterns tt. Now let J n (7r) denote the set of 7r-avoiding involutions 
of [n]. Then we can define the involutory growth rate of a pattern, L.;(7r) = lim^oo |/ n (7r)| 1 / n . 
This limit may not exist in general, but it does in some special cases, leading to the following 
conjecture. 

Conjecture 2.1. Let tt be a permutation pattern. Then L^tt) exists and L^tt) 2 = L(tt). 

Table [1] shows J„(vr), S^vr), and the ratio of their squares in cases when both are known. 
We note in particular that the conjecture is true for all patterns of length at most 3. We also 
note that Wilf-equivalence of two patterns is not the same as "involutory Wilf-equivalence" . 
In particular ^(vr 7 ")! = | iSVi^vr) | , where TT r is the reversal of tt, but it is not necessarily 
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rates are both known. C n and M n are the Catalan and Motzkin numbers, 
respectively; a^, are constants depending on k. 



true that |J n (7T r )| = |J n (7r)|. Counterexamples include 7r = 132 and tt = 12345; we have 
|/ n (12345)| ~ (7r 3 /8)4"n- 3 [IS] but 4(54321) ~ f 4 n n" 3 [5]. 

The pattern 1342 has growth rate 8 and the pattern 12453 has growth rate (1 + v^) 2 
[1]; the latter is the first known example of a pattern with non-integer growth rate. Bona 
has shown [H Lemma 5.4] that given a pattern tt of growth rate L(tt) = g 2 , the pattern 71' 
obtained by adding 1 to each element of tt and prepending 1 to the result has growth rate 
L(tt') = (g + l) 2 . In other words, this operation raises the square root of the growth rate by 
1; thus there is some precedent for studying a/ L(tt). Perhaps in general L^tt') = L^tt) + 1. 

Conjecture 12.11 can be restated probabilistically. The probability that a random permuta- 
tion of [n] is 7r-avoiding seems to be the square of the probability that a random involution 
of [n] is 7r-avoiding, multiplied by some asymptotically subexponential factor. (In the few 
known cases this factor is Cn~ k for some real constant C and nonnegative rational number 
k.) Thus involutions are, in general, more likely to avoid patterns than ordinary permuta- 
tions. This is because an involution is, in a sense, half a permutation. The RSK algorithm 
[22] takes a permutation tt to a pair of Young tableaux (P, Q); if tt is an involution then 
P = Q, so involutions can be identified with individual Young tableaux. The "graph" of a 
permutation tt is the set of points {(i, tt(z)) : 1 < i < n} and an involution can be specified 
by fixing only the points on or below the diagonal, identifying involutions with half-graphs. 

Finally, Egge has studied permutations with graphs which are symmetric under other 
reflections or rotations [7]. One might hope these lead to further generalizations of Conjecture 
12.11 To give an example, involutions invariant under the reverse complement are determined 
by one- fourth of their graph, and the number of such permutations which are also 132- 
avoiding grows like 2 n / 2 . Up to polynomial factors this is the fourth root of the Catalan 
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number C n , which is the number of 132- avoiding permutations. Wulcan |28j has enumerated 
involutions avoiding generalized patterns, including all the generalized patterns of length 
3; at this point no systematic review of the growth rates of the corresponding patterns in 
permutations has been undertaken. 

3. The number of permutations with all cycle lengths in some finite set. The fact 
that the number of involutions of [n] is approximately \fn\ can be generalized to permutations 
with cycle lengths lying in any finite set. We call a permutation with all cycle lengths lying 
in the set S an S'-permutation. The logarithmic asymptotics of S"-permutations are governed 
by the largest element of S. 

Theorem 3.1. Let S be a finite set of positive integers, with m = max S 1 , and such that the 

( s) 

elements of S do not all have a common factor. Let n\p n be the number of S -permutations 
of [n] . Then 

p^nl 1 ^ ~ ■C s n~ l ' 2+l ' 2m eMfs{n l ' m )) 
for some polynomial f$ of degree m — 1 and constant Cs which can be explicitly computed. 
In particular, 

\0gp { n ] , 

hm — — = —l/m. 

n->oo log TV. 

The condition gcd5* = 1 is a technical one required so that exp(^ sg5 z s / s) is Hayman- 
admissible. 

Proof. We apply Hayman's method [HUES] to the generating function f(z) = exp (J2 s( z S z s / s) . 
We have 

P (S) ^ 

where a(z) = X^es 2 ^' r ™ * s ^ ne positive real root of a(z) = n, and b(z) = ^2 s&s sz s . Using 
the Lagrange inversion formula, we can find an asymptotic series for r n in descending powers 
of n l l m . (See [25] for details.) From this we can determine the leading-term asymptotic 
behavior of f(r n ) and r"; we get f(r n ) = exp(n/m + cin^ m ~ 1 ^ m + ■ ■ ■ + c m n° + 0(n~ 1 / m )) 
and r™ = rW m exp(d 1 n( m " 1 )/ m + d 2 n^ m - 2 ^ m + ■■■ + d m n° + O^ 1 '" 1 ) for constants c k ,d k 
depending on S. Finally, b(r n ) ~ mn. So 

, S) exp(n/m + c^™- 1 )/™ + ■■■ + c m n° + 0(n~ l / m ) 

Pn ~ n n/m e xp(d 1 n( m - 1 )/ m H h d m 7l + C^n" 1 /™) 

and applying Stirling's approximation gives the result. □ 

To illustrate the theorem, consider S = {1,2,3}, so r n is the positive real root of z + 
z 2 + z 3 = n. This has asymptotic series r n = n 1 / 3 — | — |n~ 1//3 + ^n~ 2 ^ 3 + 0(l/n) for 
large n. From this we can find the leading terms ~ n n//3 exp(— n 2//3 /3 — 5n 1 / 3 /18) and 
f(r n ) ~ exp(n/3 + n 2 l 3 /Q + hn l ' 3 /9 - 5/18). Thus 

< S) exp (j + \n 2 / 3 + |nV3 - JL) 

Pn ~ 

n n / 3 yQwn 

and finally 



Corollary 3.2. The expected number of cycles of length k in an S -permutation chosen 
uniformly at random, where k G S and m = maxS 1 , is n k ^ m /k ■ (1 + o(l)) as n — > oo. 

This has also been shown by Benaych- Georges [2] and Timashev [23] . 

Proof. Let a n = n\p n be the number of S-permutations of [n\. The generating function of 
S-permutations by their size and number of /c-cycles is 



G^ s) (z,u) =exp Vz s /s + (u-l)z*/Jfe 




The mean number of fc-cycles in S'-permutations of [n] is therefore 
[*»](& G^(z,u)\ u=1 ) [ z %z k G^(z,l) lp 



n—k 



[z n ]G( s ){z,l) [z n ]G( s ){z,l) k p n ' 

Now, Pn-i/Pn ~ (n — l)!~ 1 / m /n!~ 1 / m = n 1 /" 1 , the subexponential factor in Theorem 13.11 
being slowly varying. So the mean number of /c-cycles is asymptotic to \ {n 1 ' rn ) k , as desired. 

□ 

The Boltzmann sampler [6] for S'-permutations provides an explanation for Corollary 13.21 
To generate random S-permutations, we fix a positive real parameter x and then pick a 
cycle type by taking V(x k /k) cycles of length k for each k G S. The cycles themselves 
are then populated with elements uniformly at random. Fixing x to be the positive root of 
Sfces zk = n ~~ ^hat * s > x = Tn _ gi ves permutations of expected size n, and all S-permutations 
of the same size are equally likely to be generated. The expected number of fc-cycles of a 
permutation generated by this process is r k /k. 

Alternatively, we could find the number of permutations with all cycle lengths in some 
set S with largest element m by summing over cycle types. For example, for involutions 
we have the sum J2 l+2k=n or YJk=o ( n -2k)\k\2^ ■ More generally, if S = {s u sj} with 
si < s 2 < ■ ■ ■ < Sj, then the number of S-permutations of [n] is given by 



(2) 



c l \...c j \s?...s c j 

-...+c,s,=n J 1 J 



where the sum is over j-tuples of positive integers (ci, ...,Cj). This summand can be ap- 
proximated as a multivariate Gaussian integral. It may be possible to do this integral using 
Laplace's method; Greenhill et al. [TT1 Thm. 6.4] give a version of this method adapted to 
sums over high-dimensional lattices. We do not carry out this computation as the method 
of Theorem 13.11 is effective. 

4. Graph-theoretic decomposition. An involution a can be represented as a partial 
matching on the set [n], where k and / are matched if c(k) = I (and therefore a (I) = k). We 
can view this matching as a graph, by drawing an edge between k and / when o~(k) = I. A 
pair of partial matchings or involutions, (a, r), can be identified with a graph on the vertex 
set [n] with 2-colored edges, where we color the edges solid or dotted according to whether 
they are from a or from r. We write a U r for this graph, and refer to it as as a superposition, 
and too for the corresponding permutation. 
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Theorem 4.1. The trivariate generating function for pairs of partial matchings (er, r), 
counted according to the size of the ground set (indicated by the variable z) and number 
of paths and cycles in o~Ut (indicated by u andv respectively), exponential in z and ordinary 
in u and v, is 

That is, n\[z n u k v l ]Q(z, u, v ) is the number of pairs of partial matchings on n vertices with k 
paths and I cycles. 

Proof. We enumerate the possible connected components of a pair of partial matchings and 
apply the exponential formula. 

The connected components of such a graph are cycles of even length and paths, with the 
edges alternating in color. These are the only possible components since if colors are ignored, 
all vertices must have degree at most two. We note that the degenerate path (a single vertex) 
and the degenerate cycle (two vertices connected by a solid edge and a dotted edge) are both 
possible components. We consider the length of a path to be its number of vertices, so a 
single vertex is a path of length 1. 

For n > 3, the number of labelled cycles of length n is ((n — l)!)/2; there are two ways to 
color a labelled cycle of even length with alternating edge colors, and zero ways to color a 
cycle of odd length. There is exactly one labelled cycle on two vertices. Thus the exponential 
generating function (egf ) for properly colored cycles is 

1t z 2 ,^ 4 r ,z 5 z 2 2 4 z 6 1 1 

1! 7T7 + 3! 77 + 5! 7T + --- = 17 + ^ + 7T + --- = 7: log 



2! ' "4! ' ~"6! ' 2 ' 4 ' 6 ' 2'"° I- z 2 

The number of labelled paths of length n is n\/2 for n > 2, and 1 for n — 1. There are two 
ways to color a path of any length - pick a color for a single edge and the coloring can be 
completed in exactly one way - except that there is only one way to color a 1-path. Thus 
there are n\ colored, labelled n-paths, for each n > 1, and the egf for properly colored paths 
is z/{l — z). 

The generating function of components marked according to their type (path or cycle) is 
therefore u ■ j 2 ^ + v ■ ~ log j^t, and applying the exponential formula gives □ 

We quickly derive two corollaries more relevant to permutation enumeration. 

Corollary 4.2. The exponential generating function of pairs of involutions is P(z) = exp(z/(l- 



0)/Vl 



z 



2 



Proof. Take the specialization u = 1 , v — 1 in Theorem 14.11 This gives the exponential 
generating function for pairs of partial matchings, which we identify with pairs of involutions. 

□ 

Corollary 4.3. The semi- exponential generating function of pairs of involutions (a, r), 
counted by the size of the ground set and the number of permutation cycles in the com- 
position t o a , is 

n K Z i U ) ~ (1 _ z 2)« 2 /2' 
6 



13 13 



5 1 2 



Figure 1 . Paths of length four and five, and a cycle of length four. 

Proof. Consider a pair of perfect matchings (a, r). Each connected component of the corre- 
sponding graph dUr gives rise to either one or two cycles in r o a. Each 2fc-cycle in the 
graph (T U r gives rise to two fc-cycles in the permutation r o a, corresponding to half of 
the vertices. Each fc-path in a U r gives rise to a fc-cycle in r o a. For example, the paths 
illustrated in Figure H] correspond to the permutation cycles (1342) and (13542) respectively, 
and the cycle to the pair of permutation cycles (13) (24). To count by permutation cycles, 
then, we need z n u k v l in Q(z, u, v) to be mapped to z n u k+21 ; thus we take the specialization 
R(z,u) = Q(z,u,u 2 ) in Theorem 14.11 □ 

5. Asymptotic distribution of the number of k-cycles. In this section we show 

Theorem 5.1. The distribution of the number of k-cycles of the composition of a pair of 
random involutions of[n] converges in distribution to the distribution of Ak + 2E>k as n — > oo ; 
where Ak and are independent, Ak is Poisson of mean 1, and E>k is Poisson of mean 
1/(2*). 

We need the following more general result. Recall that a sequence {b n } is slowly varying 
if limn-,*, b n ^/b n = 1. 

Lemma 5.2. Let P(z, u) be the generating function 

P(z,u) = V P n . k —u k 

n,k>0 

where P n ^ is the number of objects in a combinatorial class V of size n with some parameter x 
equal to k. Assume P(z,u) = Q(z)e R ^ z ' u ' with R a polynomial, [z n ]Q(z) is slowly varying as 
n — > oo, and R(l,t) is the factorial moment generating function of some distribution which 
is determined by its moments. For each n, let P n (x — k) — P n ,kJ J2k ^n,k define a probability 
distribution on the positive integers. Then as n — > oo, the sequence of distributions of x on 
P n converges in distribution to the distribution with factorial moment generating function 
exp R(l, t). 

Proof. Let j be the degree of R in the variable u. We can show by induction that d r u P(z, 1) = 
P(z, l)T r (z), where T r (z) has degree jr. Then we have 

F (M , _ [z n ]d:P(z,u)\ u=1 _ [z n ]P(z, l)T r (z) 

nllXJrj [z n ]P(z,l) [z n ]P(z,l) ' 

Now, lim^oo [z n ~ s ]P(z, l)/[z n ]P(z, 1) = 1 from the condition on slow variation. Then 

[z n ]P(z,l)T r (z) [z n -']P(z,l) 
[z"]P(z,l) K WW 
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and taking limits as n — > oo gives 



n-*oo [^]P(z, 1) ^ ; ' 

So lim^oo E n ((x) r ) = 7r(l). Now let F(t) = R(l,t). The rth factorial moment of the 
distribution with factorial mgf F(t) is F^ r \l). This is J^r exp R(z, u) | _ _, and we recall 
that d^Q(z) exp R(z,u) | u= i = P(z, l)T r (z) by definition. Therefore we have 



<9 r 

- exp P(z, u) 



du 



P(l,l)T r (l) =Tr(1) 



Q(i) 

which is what we wanted. □ 



Proof of Theorem I5.il We apply Lemma 15.21 with 

_,, , exp(z/(l - z)) ( ' k , 2 ^-2 2fc \ 

The slow variation hypothesis holds since [z n ] exp(z/(l— — z 2 = a? n /n\ = e 2 ^ / ™(8iren)~ 1 / 2 (l + 
o(l)). We have R(z,u) = exp((u — l)z h + (u 2 — l)z 2k /(2k)); this is the factorial moment 
generating function of + 2Bk, which follows from the fact that Poisson(X) has factorial 
mgf exp(A(t — 1)). Finally, we recall that if the moment generating function of a random 
variable has positive radius of convergence, then the random variable is determined by its 
moments [31 Thm 30.1]. A& + 2B k has mgf exp(e* — 1 + (e 2t — I) /2k), which is entire. □ 

The sum of Poissons given in Theorem 15.11 is quite natural. There are two types of 
components in superpositions of partial matchings on [n] that can lead to fc-cycles of the 
corresponding permutations: paths of length k (which induce one permutation fc-cycle) and 
cycles of length 2k (which induce two permutation fc-cycles). For large n and fixed k, the 
expected number of fc-paths approaches 1 and the expected number of 2/c-cycles approaches 
1/k. Furthermore, the sites in which individual cycles can appear are each rare, so it is not 
surprising to see an independent Poisson distribution for each type of component. 

6. Partial matchings with a specified number of fixed points. In this section we 
consider superpositions of partial matchings, o U r, where a is chosen uniformly from all 
partial matchings on [n] with k fixed points, and r is chosen uniformly from all partial 
matchings with / fixed points. 

Proposition 6.1. The expected number of r -paths in a U r is 

(n—k\ ( n—l\ Qr~l 

( 4 ) kl \ — )(r-l)/2( — )(r-l)/ 2 2 

(n) r 

if r is odd, and 

(t , W - l)(^)r/2-l(^),/2 + Kl ~ l)(^),/ 2 (^),/ 2 -l)2- 1 

if r is even. 



Proposition 6.2. The expected number of r- cycles in o~ U r, is 

I n—k \ I n—l \ or 
V 2 )r/2 V 2 Z 

1 j r(n) r 
/or even r. 

These statements can be easily verified. For odd paths, we compute the probability that a 
path occurs which traverses the edges 1,2, ... ,r in that order, and multiply by the number 
of possible paths. The argument is similar for even paths, except we must handle the cases 
where the two ends of the path are fixed points in a and fixed points in r separately. Finally, 
we do this for cycles; the most interesting feature is the factor of r in the denominator which 
arises from the symmetry of cycles. This model of random involutions, with n —>■ oo and k, I 
varying with n in such a way that k + I = fi(l) and k + / = o(n) simultaneously, has been 
considered in [20] in the context of dynamical systems. 

Corollary 6.3. The expected number of paths of length r in a U r, the superposition of 
two randomly selected perfect matchings on [n], where a and r each have pn fixed points, is 
asymptotic to p 2 {l — p) r_1 as n — ► oo. The expected number of cycles of length r (if r is 
even) approaches (1 — p) r /r as n — > oo. 

Corollary 6.4. Let r = 0(y/n) as n —>■ oo. Then the mean number of r-paths in a U r , 
where a and r are randomly selected involutions with y/n fixed points each, is asymptotic to 
exp(— r / y/n) as n — > oo, and the mean number of r- cycles is asymptotic to exp(—r/y/n)/r. 

These follow from Propositions 16.11 and 16.21 by making appropriate substitutions, and 
applying Stirling's formula in the case of Corollary 16.41 In both cases the number of r-paths 
decays exponentially in r. This can be explained by considering the process of path formation 
as a random walk from one fixed point to another on the complete graph K n . The length of 
a path is the time such a walk takes to reach some fixed point, which is geometric and has 
expectation the inverse of the proportion of points which are fixed points. 

The last of these results can be translated back into the terminology of involutions. To 
get a better sense of the scaling behavior of cycle sizes, we look at the expected number 
of a\/n-cycles of a composition of two random involutions. The expected number of k- 
cycles is (b n _ k + ^& n _ 2 fc) /b n , where b n = [z n ]exp(z/(l - z))/\Jl - z 2 . Recall that b n ~ 
e 2 ^(8iren)~ 1 / 2 . Let k = ay/n grow with n. One can compute that b n _ as /^ ~ e~ a as 
n — >• oo. Such square-root scaling is typical of structures counted by generating functions 
which are the exponential of a function with a simple pole. The simplest example may be 
the "fragmented permutations" (permutations with rooted cycles) or "sets of lists" counted 
by exp(z/(l — z)), which will be treated in more detail in |15j . 

7. The total number of cycles. The function R(z, u) given in Corollary 14.31 will be our 
jumping-off point for asymptotic results on cycle structure. We need the following asymptotic 
result. 

Theorem 7.1 (Wright). j23 Thm. 2 and Thm. 3] 

(a) The leading-term asymptotics for 

c n = [z n ](l-z)^(z)exp 
9 



where (3 is a complex number and §(z) is regular in the unit disk are given by 
1 



^ n /3/2+3/4 

(b) The leading-term asymptotics for 



exp(2 v ^)-^$(l)e 1 / 2 



{\+0{n-^ 2 )). 



[^ n ](log-^) (1 - z)"$(z) exp 



1 



u>zt/i a positive integer can be derived from that for the k = case by differentiating 
k times with respect to (3 and switching signs if k is odd. 

In particular, applying (a) with <&(z) = e~ z /y/l + z, f3 = —1/2 gives [z n ]P(z) = ^== exp(2y / n). 
This is consistent with the known number of involutions in ([1]). 

Proposition 7.2. The mean number of components which are paths in a superposition of 
two partial matchings on [n] selected uniformly at random is \/n + 0(1). 

Proof. The bivariate generating function counting superpositions of partial matchings by size 
and number of paths is Q(z, u, 1) = exp(iiz/(l— z))/ \J\ — z 2 . A standard result in generating 
functions is that the expectation of the parameter marked by u in such a bivariate generating 
function is [z n ]Q u (z, u, l)/[z n ]Q(z, u, 1) for objects of size n. Differentiating and applying 
Theorem 17.11 gives the result. □ 

Proposition 7.3. The mean number of components which are cycles in a superposition of 
two partial matchings on [n] selected uniformly at random is |logn + OinT 1 ! 2 logrz). 

Proof. The bivariate generating function counting superpositions of partial matchings by 
size and number of cycles is Q(z, l,v) = exp(z/(l — z))(l — z 2 )~ v / 2 ; differentiate and apply 
Theorem 17.11 □ 

Proposition 7.4. The mean number of elements in cycles in a superposition of two random 
partial matchings of [n] is \\/n + 0(1). 

Proof. The generating function counting pairs of matchings by their size and number of 
elements in cycles is 

/ z u 2 z 2 u 4 z 4 u e z 6 \ exp(z/(l — z)) 

^{— + — + — + — + -J= 

Again, we differentiate and apply Theorem 17.11 □ 

Theorem 7.5. The mean number of cycles in a composition of two uniform random invo- 
lutions on [n] is y/n + | log n + 0(1). 

Proof. A superposition of partial matchings with k paths and / (graph) cycles is identified 
with a composition of involutions having k + 21 (permutation) cycles. □ 

Proposition 7.6. The probability that a superposition of two partial matchings of[n] selected 
uniformly at random has no cyclic components is \/2n^ 1 ^ + 0(n~ 3//4 ) as n — > 00. 
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Proof. Partial matchings with no cyclic components have generating function Q(z, 1,0) = 
exp(z/(l — z)); thus the probability in question is 

[z n ] exp 

[z"]exp( T ^) /VT^ 

By Theorem ET] the numerator is e 2 ^/(2n 3 / 4 y / e7r)(l+0(ri~ 1 / 2 )); the denominator is e 2v/ ™/ ^/8iren 
0(n -1 / 2 )), giving the desired result. □ 

8. Fixed-point-free involutions. 

Proposition 8.1. The number of pairs of fixed-point-free involutions (a, r) on [2n] such that 
7r = r oa has 2ck k-cycles for each k is the same as the number of permutations of [2n] which 
have Ck 2k-cycles for each k, and no cycles of odd length. 

Proof. We construct a bijection between the two sets. Given such a pair of fixed-point-free 
involutions, the graph of a U r consists of graph cycles of length 2k, with the edges 
alternately solid and dotted. From each graph cycle we construct a permutation cycle. We 
need only make a choice of direction, say by starting at the smallest element and following the 
solid edge out of that element. This operation is clearly reversible; given a permutation with 
only even cycles we can reconstruct the graph crUr of a pair of fixed-point-free involutions. □ 

Proposition 8.2. The number of cycles in a composition of two fixed-point-free involutions 
on [2n] chosen uniformly at random has the distribution of 2 X^fc=i -^k, where is Bernoulli 
with mean l/(2k — 1) and the X], are independent. 

Proof. The distribution of the number of cycles of a permutation of [2n] with all cycle 
lengths even is that of Yuk=i -%k, where X^ is Bernoulli with mean l/(2k — 1) and the X\. are 
independent [U Thm. 3.7]. From Proposition 18. 1[ there are exactly the same number of 
permutations of [2n] with 2j cycles, all of even length, as there are pairs of fixed-point-free 
involutions (cr, r) G 5*2 n x 5*2 n with r o a having j cycles. □ 

Note that the expected number of cycles in a composition of two fixed-point-free involu- 
tions of [2n] is 2H 2n — H n = logra + (2 log 2 + 7) + 0(n~ 2 ), which differs from the expected 
number of cycles in a random permutation of [2n] by log 2 + 0(n~ 1 ). However, compositions 
of fixed-point-free involutions do not "look like" random permutations. Most obviously, a 
composition of fixed-point-free involutions of [n] has no cycles longer than n/2. Cycle lengths 
satisfy the following limit law. 

Proposition 8.3. Fix constants < 7 < 5 < 1/2. Let Pi(n; 7, 5) be the probability that 1 
is contained in a cycle of t o a of length between jn and 5n, where o and r are fixed-point- 
free involutions on [n] chosen uniformly at random. Then lim^oop^n; 7, 5) = ^/l — 27 — 
v / T^25. 

Proof. Call a permutation with all cycle lengths even an E'-permutation, and a composition 
of fixed-point-free involutions an /-permutation. The number of 2fc-cycles in i?-permutations 
of [n] is half the number of fc-cycles in /-permutations of [n], by Proposition ^. II In particular 
the number of elements of 2/c-cycles in /^-permutations of [n] and the number of elements of 
A;-cycles in /-permutations of [n] are equal. So the probability that a random element of a 
random /-permutation of [n] is in a cycle of length in [jn, Sn] is equal to the probability that 
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a random element of a random .E-permutation of [n] is in a cycle of length in [2jn, 25n] . By 
[T4"t Thm. 3.5] the latter probability approaches y/1 — 27 — yl — 25 as n — > 00. □ 

Proposition 8.4. e G (0,1/2). TTie expected number of elements in k-cycles in a 
composition of two random fixed-point-free involutions of [n] converges uniformly to (1 — 
2k/n)~ 1/2 as k/n — > 00 with < fe/n < 1/2 - e. 

Proof. By Proposition 16.21 the expected number of elements in r-cycles in a superposition 
a U r of fixed-point-free perfect matchings is 

(n/2)! 2 (n-r)l 
((n-r)/2)! 2 n! ' 

In the case r = an, this is asymptotic to 1/yl — a as n — > 00, with uniform convergence 
over < a < 1; this is shown in [T4"l Prop. 3.4], where the same expression occurs in relation 
to permutations with all cycle lengths even. Noting that elements in r-cycles in a pair of 
perfect matchings give rise to elements in r/2-cycles of the corresponding permutation gives 
the desired result. □ 



9. The number of factorizations of a permutation into involutions. The square of 
the number of involutions of [n] is a bit larger than n!; we have 



2 1 e 



nen 



The mean number of factorizations of a random permutation into a product of involutions 
is just the second factor. The number of factorizations can be as large as a n for the identity 
permutation, since id = a 2 for any involution a, and as small as n — 1 for those permutations 
which consists of an (n — l)-cycle and a 1-cycle. 

Theorem 9.1. Define the function 

(r -2j)!j!2J 

Let ix be a permutation of [n] with cycles of length k, for each k. Then 

n 

F(7T)=Hf(c h ,k) 
k=l 

is the number of factorizations of tt into two involutions, i. e. the number of solutions of 
71 = t o a with o and r involutions. 

We remark that f(r, k) is the number of partial matchings of [r] with fc-colored components. 
This interpretation is key to the proof, which works by pairing up some of the /c-cycles and 
then assigning one of k partial factorizations to each unpaired /c-cycle or pair of /c-cycles. 

We begin with the following special case. 



Lemma 9.2. The number of ways to factor an n-cycle it into two involutions is n. 
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Proof. Without loss of generality let 7r = (123 ■ ■ - n). We construct a corresponding pair of 
partial matchings (a, r) . This must be a path of length n, since cycles in a U r give rise to 
pairs of permutation cycles. So we consider an unlabeled path of length n with alternating 
solid and dotted edges, and attempt to label it. We begin by labelling some vertex by 1. 
Then follow the solid edge at that vertex, followed by the dotted edge at the next vertex, 
to determine the site of 2; repeat to determine the sites of 3, 4, and so on. The remaining 
vertices can therefore be labelled in exactly one way. □ 

For example, the cycle (1234) has the factorizations 

(a, r) = ((1)(24)(3), (12)(34)), ((13)(2)(4), (14)(23)), ((12)(34), (2)(13)(4)), ((14)(23), (24)(1)(3)). 

This is also a special case of a formula given in [121, Thm. 2.1] for the number of factoriza- 
tions of an n-cycle into permutations of types A and //. Note that if n = 2k + 1, then A and 
\i each have type 2 fc l; if n = 2k, either A has type 2 k ~ 1 l 2 and \x has type 2 h , or vice versa. 

Lemma 9.3. The number of ways to factor a permutation rr of [2n] consisting of two n- 
cycles into two involutions a, r, such that the corresponding graph a U r is a 2n-cycle, is 
n. 

Proof. Without loss of generality, let tt = (1, 2, . . . , n)(n+ 1, n + 2, . . . , 2n) in cycle notation. 
We draw a graphical cycle with 2n vertices, with edges alternately solid and dotted. Label 
some arbitrary vertex with 1; follow solid and dotted edges alternately around the cycle to 
place 2,3, ... ,n. Then label some arbitrary unlabeled vertex with n + 1 and follow solid 
and dotted edges alternately around the cycle to place n + 2, . . . , 2n. There are 2n 2 ways 
to carry out this procedure. However, the unlabeled 2n-cycle with alternately colored edges 
has 2n symmetries. So there are (2n 2 ) / (2n) = n distinct labellings; each one corresponds to 
a factorization. □ 

Proof of Theorem \9.1[ Given an arbitrary permutation 7r to be factored into involutions with 
7r = t o a, we can consider the cycles of each length separately. Consider the cycles of length 
k\ assume there are r of these. We pair up some of the /c-cycles with each other, representing 
that they come from the same cycle in the graph a U r. Those cycles which remain unpaired 
arise from paths, not cycles, in a U r. We then factor each unpaired cycle according to 
Lemma 19.21 and each pair of cycles according to Lemma 19.31 If there are j pairs of cycles, 
then there are r — 2j unpaired cycles, and thus r — j total components to factor; thus the 
number of such factorizations, once the cycles are paired up, is k r ~K The number of ways 
to find j disjoint pairs of cycles, with order irrelevant, is 

0(7) -m n 

j! (r-2j)Ij!2i' 
Summing over j gives the function f(r, k) defined in the theorem. □ 

It appears that the distribution of the number of factorizations of a random permutation 
of [n] into involutions approaches a lognormal distribution as n — > oo. We consider the 
following probability model: let = V(l/k) for k = 1,2, ... ,n, where n is a positive 
integer parameter. Let m = X\ + 2X 2 + • • • + nX n , and take a permutation of [m] with 
Xk cycles of length k for each k, chosen uniformly at random from all permutations of 
that cycle type. We denote the corresponding measure on the set of all permutations by 
IP* . Then E (X)£=i kXk) = n. This model generates each permutation of [m] having all 
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cycle lengths less than or equal to n with probability e~ Hn /ml, where H n = Y?k=i ^ s a 
harmonic number; in particular for each m < n, each permutation of [m] occurs with the 
same probability. 

Theorem 9.4. As n — > oo ; 

UmP .( W))-> g ")% ,)^, 

Proo/. First, we show that E(log/(X fc , fc)) = log(fc)/A; + 0(£r 3 ). Let = E(log/(X fc , fc)). 
We can write the expectation as a sum over possible values of X k , giving 

(7) Ai^e-V^-i-log/Cr,*). 

r>l 

We can derive an asymptotic series for log/(r, fc) from the Taylor series for log(l + a;) around 
x = 0; this gives an asymptotic series for the rth term in (JTj), which is of order k~ r \ogk. 
Adding these gives fi k = (log k)/k + (1/2A; 3 ) + 0{k~ 4 ). Similarly let h k = E((log /(X fc , A;)) 2 ); 
then in like manner we can derive the series h(k) = (\ogk) 2 /k + (logk) 2 /k 2 + 21og/c//c 3 + 
0{k~ A ). The variance is given by o\ = Y(\ogf(X k ,k)) = h{k) — /i 2 , and we find <j\ = 
{log k) 2 /k + 2\ogk/k 3 + 0(\ogk/k A ). 

Next we show that X^ =1 /ifc ~ (logn) 2 /2 and Y^k=\ °\ ~ (logn) 3 /3 as n — > oo. We have 
Hk = {\ogk)/k + 0(k~ 3 ). Now, ELi( lo g fc )A ~ $™{\ogk)/kdk = |(logn) 2 , where the 
asymptotic equality can be justified by the Euler-Maclaurin summation formula. Expanding 
the big-0 notation, \jj, k — (\ogk)/k\ < Ck~ 3 for some constant C, so YskLi^k ~ (}ogk)/k 
converges. Therefore Y^l=i^k ~ Efc=i(l°g^)A ~ |(l°g n ) 2 - The proof for Ylk=i °\ ^ s 
similar. Note that J2k=i^k = E(logF(7r)) and Y^k=i a 1 = V(l°g-^( 7r ))- Finally, we apply 
Lyapunov's central limit theorem to show that logF(7r) is asymptotically normal. We recall 
the theorem: let Yi, I2, . . . be independent random variables with finite mean and variance, 

E(Y n ) = /i n and V(Y n ) = a 2 . Let s 2 = ELi^fe- If for some 6 > °> E (l y fc| 2+<5 ) is finite for 
k = 1,2, . . . and the Lyapunov condition 

1 n 

(8) lim ^VE(|r fe -En| 2+5 ) = 



00 S 

n k=l 



is satisfied, then the standardization (Ylk=i(Yn — Hn))/ S n converges in distribution to a 
standard normal random variable as n — > 00. We will take 5=1, and Y*. = log f(X k , k). 
As previously shown, s 2 ~ (logn) 3 /3, so s 3 ~ (logn) 9//2 /(3v^3)- We also observe E(|Y" fc | 3 ) is 
finite for each k. To check (jSJ), first note that 

E(|n - En| 3 ) = J2 [( lQ g Z( r ' fe ) - EF fe ) 3 P(X fc = r)] + (En) F(X k = 0). 
Since WY k is positive, this is less than 

^(log/(r,fc)) 3 P(X fc = r) 



r>l 



+ (EY k )F(X k = 0). 



The first term in this equation is in fact E(Y fc 3 ). (The sum giving E(Y fc 3 ) should naturally be 
over r > 0, but f{0,k) = 1 and so the r = term does not contribute to the sum.) Therefore 
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we have 

E(|y fc - EY k \ 3 ) < E(Y fc 3 ) + (EY k )F(X k = 0) < E(Y 3 ) + E(Y k ). 

But E(Y k 3 ) ~ (log£;) 3 /A; and EF fe ~ (logJfe)/Jfe as jfe -> oo. so E(|Y" fc - EF fc | 3 ) ~ (log/c) 3 /*;. 
Therefore we have 

(logn) y / 2 4 vl°g n 

and in particular this goes to as n — > oo, so (jHJ) is satisfied. Therefore the standardization 
of log F(tt) converges in distribution to the standard normal, as desired. □ 

Simulation experiments lead to the following conjecture. 

Conjecture 9.5. Let ir be a permutation of [n] chosen uniformly at random. There exists 
a positive constant c ~ 0.16, such that 

n^oo y c(logn) d / 

where is the distribution function of the standard normal. 

We can refine Theorem 19.11 to count the number of factorizations n = r o a where a and 
t are involutions with s and t fixed points, respectively. This requires determining all the 
possible unlabeled graphs on [n] with properly 2-colored edges which can be labeled to give 
two involutions which compose to a permutation with the cycle type of 7r, and then counting 
the labellings which actually give tt. This is impractical for large s and t. The fixed-point-free 
case, though, is straightforward. 

Proposition 9.6. Let C\,C2, ■ ■ ■ , c n be nonnegative even integers with Y^=i k°k — n>- Then 
the number of factorizations of a permutation ir of type l Cl . . .n Cn into two fixed-point-free 
involutions is 



JJ(c fc - 1)HW 2 



k=l 

where we adopt the convention (—1)!! = 1. 

Proof. The graph a U r corresponding to such a factorization consists of c k cycles of length 
2k, for each k. The permutation fc-cycles can be paired up into graphical cycles in (c k — 1)!! 
ways. Each pair of permutation fc-cycles thus obtained can be used to label a graphical 
cycle in any of k ways, following Lemma 19.31 Thus the number of ways to arrange the 
elements of fc-cycles of % in the graphical representation is (c k — l)!!A; Cfe//2 . The total number 
of factorizations is just the product over cycle lengths. □ 

We note that if any of the c k are odd, then 7r has no factorizations into fixed-point-free 
involutions. Furthermore, the proportion of permutations of [n] having all c k even (that is, 
an even number of cycles of each length) is G(n -2 ). The details of this enumeration and the 
cycle structure of such permutations will be considered in 
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